International Journal of Scientific Engineering and Technology 
„ ,<^T_ Volume No.3 Issue No. 12, pp : 1410-1413 

Power Efficient Optimized Arithmetic and Logic Unit Design on FPGA 



(ISSN : 2277-1581) 
01 Dec. 2014 



Siddharth singh parihar, Prof. Rajni Gupta 

Dept. of Electronics and Communication Engineering ,KNPCST College, India 
siddharth.siddy.singh@gmail.com, rajni_gupta2007@yahoo.com 



Abstract: This paper deals with low power ALU design and its 
implementation on 90nm Spartan 3 FPGA. Most of power is 
consumed in ALU in any processor and hence reduction in 
ALU power is needed. In this work, we have designed a low 
power ALU. To reduce dynamic power consumption we disabled 
the blocks which are not needed in currently selected operation. 
Also hardware is reused; this will cut down the FPGA resource 
usage and also reduce the power consumption. By using these 
methods dynamic power consumption is reduced and less FPGA 
resources were consumed. 

Index Terms — FPGA, ALU, low power, Hardware reuse, tri-state 
logic, dynamic power consumption 

I. INTRODUCTION 

This is an era of hand held devices and equipments, most of these 
devices runs on battery, this puts a constraint on standby time, to 
increase standby time more and more battery life is needed, one 
way of solving this issue is to reduce power consumption of 
device or equipment. These days almost every device is 
intelligent, this intelligence came from using processors, and in 
forthcoming years this trend is likely to be increase. But these 
processors consume lot of the power of device as lot of switching 
activity is going inside. ALU (Arithmetic and Logic Unit) is the 
heart of any processor; this also consumes most of the processor 
power. In this work we worked in order to reduce power 
consumption of ALU. We have designed an eight bit optimized 
ALU, the size of the ALU can be easily increased to 16, 32 or 64 
bit. A two level optimization is implemented, first we have 
reduced the FPGA resource consumption by reusing them for 
different operations, details are given in forthcoming sections, this 
will cut down FPGA resource consumption and also power 
consumption of design, several blocks are designed to implement 
specified 16 operations, in second level of optimization we enable 
only one block at a time which is currently selected and all other 
blocks are disabled, this reduces dynamic power consumption of 
device and makes our design more greener. 

Section 2 of the paper deals with the design of optimizeALU. 
Section 3 is having results. Section 4 contains concluding 
remarks of this work. Section 5 contains future scope. 

II. DESIGN OF ARITHMETIC AND LOGIC UNIT 

The inputs to ALU are A, B (operands), Clk (clock), selection 
(to select one operation out of sixteen operations). Outputs 
from ALU are Z (result) and flags. The steps in designing 
ALU are discussed in next sections. 



A. Operations 

Our design support sixteen operations, these operations 
are listed in table 1 . 

TABLE I 
ALU OPERATIONS 



Selection 


Operation 


0000 


Clear 


0001 


Hold B 


0010 


Complement B 


0011 


Hold A 


0100 


Complement A 


0101 


Decrement A 


0110 


Increment A 


0111 


Shift Left A 


1000 


Add (A + B) 


1001 


Subtract (A - B) 


1010 


Add with Carry (A + B + 1) 


1011 


Subtract with Borrow (A - B - 1) 


1100 


Logical AND (A AND B) 


1101 


Logical OR (A OR B) 


1110 


Logical XOR (A XOR B) 


1111 


Logical XNOR (A XNOR B) 



Different flags are generated depending upon the result of 
operations. 

B. Arithmetic and Logic Unit Design 

To support all sixteen operations mentioned in section A, 
different modules are designed. 

Here six operations are similar in nature and can be 
designed using similar blocks; namely Add, Subtract, Add 
with carry, Subtract with Borrow, Increment, and 
Decrement. As we know that subtraction can be 
implemented using 2"s complement, here we have used this 
property of subtraction and implemented it using adder 
block. Say we have to implement A - B, this can be 
implemented as (A + (2"s complement of B)). Here we have 
reused FPGA resources by using same adder block. 
Similarly increment (A + 1), Decrement (A - 1), Add with 
carry (A + B + 1) and Subtract with borrow (A - B - 1) is 
implemented using adder and 2"s complement block. This 
way above mentioned six operations are implemented using 
single adder and 2"s complement block, and hardware 
resources are conserved. The high level block diagram of 
ALU is shown in figure 1 
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Fig. 1 . High Level Block diagram of optimized arithmetic and logic unit 
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Different modules are designed to perform different 
operations. At a given time only one operation can be 
performed depending upon the value of „sel" (selection line to 
select a particular operation). Here we can utilize this property 
of ALU to our benefit, we enable one block at a given time 
and disable all others blocks, this will stop the switching 
activities of disabled blocks and help in reducing dynamic 
power consumption ref fig 2. This way dynamic power 
consumption of ALU can be reduced and makes our design 
more energy efficient. 
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Fig. 2. High Level Block diagram of optimized arithmetic and logic unit 
(ALU); here Complement A is currently selected operation and as 
shown, only complement A block is enabled and all other blocks are 
disabled; this will reduce switching activities and will result in 
reduction of dynamic power consumption. 



Figure 3: Top level Schematic of Arithmetic and logic unit with tri-state 
logic 

Figure 3 shows the top level schematic of arithmetic and logic 
unit which has five inputs to it, A and B are 8 - bit data inputs 
or operands to the ALU, Operation is a 4 - bit input by which 
any of the operation out of 16 can be selected, Clk is the clock 
input and g_reset is a reset pin by which the ALU can be 
reset, it is mainly used in the shift_l operation. 




Figure 4: RTL Schematic of Arithmetic and logic unit with tri-state logic 
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power consumption inside FPGA is reduced and this makes 
III RESULTS our design more energy efficient. 



We have used Xilinx 14. li to implement the design on 90nm 
Spartan 3 FPGA xc3s50-5pq208. Xpower analyzer is used to 
perform power analysis. ISIM is used to perform behavioral 
simulation. 

A. Resource Utilization 

Table 2 shows the resource utilization summary. In this work we 
have reused the hardware resources such as; we have used a 
single block for adder, for the implementation of six different 
operations addition, addition with carry, subtraction, subtraction 
with borrow, increment and decrement. This way the hardware 
requirement of the design comes lower. 

TABLE 2 

Resource utilization summary 



Frequency 


Clock 

(mW) 


Logic 
(mW) 


Signals 
(mW) 


IO 

(mW) 


Dynamic 
Power 
(mW) 


Previous 
Design 


100 Mhz 


1 


0 


1 


0 


2 


3 


lGhz 


8 


5 


8 


2 


23 


28 


10 Ghz 


76 


26 


58 


24 


184 


NA 


100 Ghz 


761 


31 


384 


243 


1418 


1725 


IThz 


7608 


31 


3589 


2432 


13659 


16812 



C. Behavioral Simulation 

We have xilinx ISIM for behavioral simulation; we tested our 
design with different values of input and in all conditions our 
ALU is behaving as per the specifications. Figure 5 shows the 
behavioral simulation of optimized ALU. A and B are our two 
inputs, elk is used as clock source, Z is output port, operation is 
input by which any operation out of sixteen can be selected and 
g_reset is used to reset the ALU and it clears all registers inside 
ALU; output Z after reset is 0. All the values are depicted in 
hexadecimal. Refer figure 3, all the possible values of operation 
(0-F) are applied and the output Z can be seen under all sixteen 
conditions. 

D. RTL Schematic 

Figure 4 shows the RTL schematic of optimized low power 
arithmetic and logic unit. 

E. Conclusion 

ALU is the core of processor, and optimizing ALU can 
significantly improve the performance of processor. In this 
work we worked in order to reduce power consumption and 
resource utilization of FPGA. As we can conclude from power 
report that by disabling the inactive blocks, dynamic power 
consumption can be significantly reduced; this is because of 
decrease in switching activities inside ALU. Next improvement 
we tried to implement is reduction in FPGA resource usage; we 
removed few blocks such as subtract, increment, decrement, 
add with carry and subtract with borrow and implement all 
these functions using single adder and 2"s complement b lock. 
This fulfills our two purposes; first, reduction of FPGA 
resource usage and second reduction in power consumption. 



Logic Utilization 


Used 


Number of Slice Flip Flops 


27 


Number of 4 input LUT"s 


165 


Number of occupied Slices 


113 


Number of bonded IO 


30 



B. Power Report 



F. Future Scope 

In this work we implemented our design on 90nm Spartan 3 
FPGA. One possibility of improvement is designing it on 28nm 
FPGA. Next possibility is optimizing the code; this will 
improve the performance of ALU. The size of the ALU can 
also be increased to 16, 32 or 64 bit. Our amendments will 
show a better performance on larger size. 



We used Xpower analyzer tool from xilinx to calculate power 
consumption of device. The design is operated at different 
frequencies and power consumption is noted accordingly. 

Here we haves the concept of tri-state logic, we have forced to 
turnoff all the blocks which are not required in current selected 
operation, eg. Suppose we want to perform complement B 
operation, this operation only requires block"complement B" 
and rest of the block are not required so 

we tri-stated the inputs to rest of the block, this will not allow 
the output capacitor to discharge and hence the switching 
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Figure5. Behavioral simulation of optimized low power ALU. A and B are input, 
Z is output, operation is input by which any on e operation out of sixteen can be 
selected. Here all sixteen possibilities of operation (0 - F) are applied. #all values 
in hexadecimal 
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